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A NOTE ON THE MAXIMAL OUT-DEGREE OF GALTON-WATSON 

TREES 

XIN HE 


Abstract. In this note we consider both the local maximal out-degree and the global maximal 
out-degree of Galton-Watson trees. In particular, we show that the tail of any local maximal 
out-degree and that of the offspring distribution are asymptotically of the same order. However 
for the global maximal out-degree, this is only true in the subcritical case. 


1. Introduction 

In a Galton-Watson tree (GW tree), we call the number of offsprings of a vertex the out-degree 
of this vertex. Then naturally by maximal out-degree of the GW tree we mean the maximal 
number of offsprings of all the vertices in the tree. In [3] we have proposed a new way to 
condition random trees, that is, condition random trees to have large maximal out-degree. Then 
under this new conditioning, we studied local convergence of critical or subcritical GW trees. 
Theorem 3.3 in [3] states that in the subcritical case, tail of the offspring distribution and tail 
of the maximal out-degree are asymptotically equivalent, apart from the ratio 1 — /u, where g 
is the expectation of the offspring distribution. This result seems to be interesting on its own, 
although in [4) it is essential for the proof of local convergence in the subcritical case. 

Motivated by [4], later in [5] we have systematically studied the maximal jump of continuous- 
state branching processes (CB processes). Note that here the maximal jump of CB processes 
actually corresponds to the maximal out-degree of GW trees, since vertices of Levy trees do 
not branch at the same time, after binary branches are excluded from the consideration. In 
particular, we have distinguished the local maximal jump and the global maximal jump , which 
are the maximal jump over a finite time interval and the whole time interval, respectively. 
Several exact expressions and explicit asymptotics of both the local maximal jump and the 
global maximal jump of CB processes are obtained. An interesting phenomenon there is that, 
the tail of any local maximal jump and that of the Levy measure are asymptotically of the same 
order. However for the global maximal jump, this is only true in the subcritical case. Naturally 
one wants to ask: Is there a similar phenomenon in the setting of GW trees? The answer is yes, 
and in this note we present this phenomenon in the setting of GW trees. 

Now let us explain our results. Lemma 12.11 gives an expression of the finite-dimensional 
distribution of the maximal out-degrees at different generations. For the maximal out-degree at 
a certain generation, Proposition 12.21 shows that its tail and that of the offspring distribution 
are asymptotically of the same order. For the local maximal out-degree, which is the maximal 
out-degree over a finite time interval, Theorem 12.31 shows a similar situation. Note that Theorem 
12.31 can actually be proved by the method used in the proof of Theorem 3.3 in |4j, however in 
this note we use a slightly different method based on mean value theorem and Taylor’s theorem, 
which we feel is more revealing, among several advantages . Then we consider the global maximal 
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out-degree , which is the maximal out-degree over the whole time interval. In Proposition 12.61 we 
prove Theorem 3.3 in [4] with our new method. The global maximal out-degree in the critical 
case is more subtle than that in the subcritical case. However we can still use our new method 
to get some partial results in Theorem 12.81 

Although our papers HUH] have motivated this note directly, it has also been influenced by 
two interesting papers mm from Bertoin. In fact, mm have also influenced mm in certain 
results and arguments. The topics in mm are the global maximal out-degree in two special 
critical cases and its scaling limit. Our setting in this note is more general, and we consider 
both the local maximal jump and the global maximal jump. However we do not consider the 
scaling limit here, which seems to be the main motivation of mm- 

In the remaining of this introduction, we review some notations of GW trees. In the next 
section, we state and prove our results. 

We use r{p) to denote a GW tree with the offspring distribution p = (po,Pi,P 2 , • • •) on non¬ 
negative integers. We say that p is bounded , if the set {r;p r > 0} is bounded. Use P to denote 
the underlying probability, and p the expectation of p. We assume that p £ (0, oo), so that 
p o < 1. We use (sub)critical to mean critical or subcritical, that is, p < 1. It is well-known that 
r(p) has finite vertices a.s. if it is (sub)critical, and has infinite vertices with positive probability 
if it is supercritical. For this result and more, refer to the standard reference |T] . Following the 
terminology of Galton-Watson processes, we also say that r{p) is extinct if it has finite vertices, 
and is non-extinct if it has infinite vertices. 

At time (or generation) 0, the GW tree r(p ) has a single vertex, the so-called root. At 
generation 0, the root branches into several offsprings according to the offspring distribution 
p. We use M n to denote the maximal number of offsprings of all vertices at generation n, and 
M[o t n\ the maximal number of offsprings of all vertices in the first n + 1 generations, including 
generation 0. Then clearly P [Mo = r] = p r . We also use M = M[ 0iOO ) to denote the supremum 
of all out-degrees of r(p). Clearly in the (sub)critical case M is finite a.s. and it is actually the 
maximum of all out-degrees. Use F(r) to denote the distribution function of p, F{r ) the tail 
function. Similarly we use H (r) to denote the distribution function of M, H (?’) the tail function. 
Note that in the supercritical case, it is possible for M to be infinite. 

We use G(x) to denote the generating function of p ,that is, for x £ [0,1], 

G(x ) = po + pix H-b p n x n H-• 


Clearly G has derivatives of all orders, and 

dG(x ) 


dx 


E = l— - A*' 


We write G 2 (x ) = GG{x) for G(G(x)). By induction, clearly 

dG n {x). 


dx 


C —1 — = H 


Finally for any nonnegative integer r, we use G r (x) to denote the generating function of p 
truncated at r, that is, for x £ [0,1], 

G r {x) = po + pix -b p r x r . 


2. The results 

We begin with an exact expression of the finite-dimensional distribution of the maximal out- 
degrees at different generations. This result is very elementary, however here we give a general 
version with our applications later in mind. 
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Lemma 2.1. Recall that M t is the maximal out-degree at generation i. Then 

P [Mi < r-,,0 <i < n] = G ro G ri ■ ■ ■ G rn ( 1). 

Proof. For n = 0, trivially 

P [M 0 < r 0 ] = p 0 + pi -\ -1-Pro = G'ro(l). 

For n = 1, by considering the out-degree of the root we get 

P [M q < r 0 ,Mi < n] = po+piP [M q < n] 4-h P [M 0 < n ] ro = Gr 0 G ri (l). 

The general case follows by induction and a similar identity 

P [Mi < r,,0 < i < n] = p 0 +piP [M*_i < n, 1 < i < n] H-l-Pr 0 P [Mi -1 < 1 < * < n] r ° . 

□ 


This lemma implies that 

P [M n <r} = G n G r ( 1) = G n (p 0 + ■■■ + Pr ). 

In general, the generating function G has no explicit expression. However, we can get a simple 
asymptotic result of P [M n = r] as r —> oo. 

Proposition 2.2. It is always true that 

P [M n = r\ < p n p r . 

Assume that p is unbounded, then as r —>• oo, 

P [M n = r] ~ p n p r and P [M n > r] ~ p n F(r), 
where the first limit is understood along the infinite subsequence {r;p r > 0}. 


Proof. The second limit is implied by the first one. For the first limit, by mean value theorem 
we have 


Px [M n = r] 


G n (po + ■ ■ ■ + Pr) ~ G n (po + • • • + Pr-l) — pr 


dG n (y) 
dy 


y — a r ? 


where a r is a certain point in the open interval (po + • • • + p r -i,Po + • • • + p r )• The inequality 
is obvious since 


dG n (y) 
dy 


y=a r < 


dG n (y ), 
8y 


\y=l- — h 


For the limit, clearly we have as r —> oo, 


P [M n = r] jp r -A 


dG n (y ) 
dy 


\y=l- ~ h 


□ 


Then we study the local maximal jump over the time interval [0, n], that is, Mr 0)n i. Proposition 
12.21 clearly implies that 

n n 

(1) P [M[ 0 ,n] =r\<J2 P = r}<PrJ2 

i —0 i =0 

We will show that P [M[ 0 , n ] = r] and this upper bound are asymptotically the same, as in 
Proposition 12.21 
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Theorem 2.3. Assume that p is unbounded, then as r —>• oo, 

p [ M [o,n] = r] ~ (1 + p 4- and P [M [0jn ] > r] ~ (1+ /4H-|-At n )F(r), 

where the first limit is understood along the infinite subsequence {r\p r > 0}. 

Proof. The second limit is implied by the first one, so we only need to prove the first one. When 
n = 0, triviallyP [Mq = r] = p r . For n > 0, by Lemma 12.11 and mean value theorem, we have 

P [M [M = r] = GyGy (1) - Gr^G^fil) 

= GV-iG”(1) - GV-iG^l) + p r [G?(l)] r 

= P = r] + Pr-[G,(l)] r , 


where a r is a certain point in the open interval (G”_ 1 (l), G”(l)). From (H|) we know that for 
any n > 0, Mr 0)n i has finite expectation. Consequently for any n > 1, lim r _ > . 00 [G”(l)] r = 1, since 
Gy( 1) = P [Mr 0 j „_ 1 ] < r]. It is also easy to see that for any i?, 


p R = lim 9Gr ^ | < lim 

r—>■ oo (jy r—too 

where p R = Wi- Notice that lirri/j^^ 
Af[o. ri _i] gives that as r —> oo, 


dG r _ \ ( y) 
dy 


i y=ar < lim 


dG(y ) t 

dy 


I y=a r — hi 


p R = p. Finally assuming the statement is true for 


P [M[0,n] — r ] ~ P [-^[0,n-l] — r ] P +Pr ~ {p n + •••+// + l)Pr- 


□ 


Note that the above theorem can also be proved by the method used in the proof of Theorem 
3.3 in [3], which we call the barehands method. Here we used a slightly different method, which 
we call the calculus method. We feel that the most important advantage of the calculus method is 
that it correctly identifies p as the derivative of the generating function G at 1—. The calculus 
method is also more automatic and more efficient, we believe. Now let us use this calculus 
method to get a generalization of both Proposition 12.21 and Theorem 12.31 


Proposition 2.4. Assume that p is unbounded. Consider a sequence of m positive inte¬ 
gers (n,;,l < i < m) such that m < nj for 1 < i < j < m, and another sequence of 
m positive integers (r ni ,l < i < m). Let r = mini <i<m,r ni . Assume additionally that, 
limsup^oQ rj /min j<i< m r ni < oo for any 1 < j < n. Then as r —> oo, 


{ AL ni — r ni , M n . < r n .; 1 < j < m , j i\ 


_i =1 


E 

i= 1 


P Pr n 


where the limit is understood along the infinite subset {(r ni , 1 < i < m): p rn . > 0}. 


Proof. We argue by induction. When m = 1, this is exactly Proposition 12.21 When m = 2, we 
need to prove that as r = r ni A r n2 oo, 


P ni Pr ni + p n2 Pr n 


P {M ni = r nx , M n2 < r n2 } (J {M n2 = r n2 , M m < r m } 

Lemma EH shows that the l.h.s of the above expression is 

G ni G rni G n >- n '- 1 G rn2 ( 1) - G ni G rni _ 1G n2_rai ~ 1 G Tn2 _i (1). 
Then as in Theorem 12.31 we get that as r —> oo, 


Qri! q Q n2 


—n\ — l 


Gr n2 ( 1 ) - G H1 G rni _ i G n2 ~ ni - 1 G rn2 -1 ( 1 ) 
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~ V ni {G rni G n2 ~ ni ~ l Gr n2 ( 1 ) — G rni -iG n2 ~ ni ~ 1 G rn2 -i{l)) 

= H ni (Gr^G^-'Gr^il) -Gr^G^-^Gr^il)) 
+» Ul Pr ni [G^-"i- 1 G rBa (l)]'" 1 
~ P U1+1 {G^-^G^i 1 ) - G^-'Gr^l)) +^ Prni 

~ ,, n 2 n _L 


where in the last step we used the induction for m = 1. Clearly when m > 1, we can always 
reduce the case of m into the case of m — 1 in this way. 

Notice that we then only need to check that lim,.-^ [G n2_ni_1 GV n2 (1)] 7-711 = 1. Recall from 
the proof of Theorem 12.31 that hm rn2 ^ 00 [G™ 2 “ ni (l )] r ' n 2 = 1. Finally lim^^ [Gy 2 ~ rai (l)] rni = 1 

by our assumption, so lim^oo \G n2 ~ ni ~ 1 G Tn2 (1)] r ’ H = 1. The general case can be proved 
similarly. □ 

From now on we study the global maximal out-degree M = Mi 0 oo y We begin with a char¬ 
acterization of the distribution function of M. In the (sub)critical case this result has appeared 
in [3], see page 794-795 of [3|. Here we mainly add the result in the supercritical case. 

Lemma 2.5. Assume that p is supercritical with po + • • • + p r < 1 or (sub)critical, then 

P [M < r] = q [0 ^ r] , 

where < 7 [ 0 r .] is the unique solution of the equation G r (t ) = t for t £ [0,1]. 

Proof. By considering the out-degree of the root we see that 

P [M < r] = po + PiP [M < r] + • • • + p r (P [M < ?’]) r . 

So P [M < r] is a solution of the equation G r (t) = t for t £ [0,1]. 

Then we only need to show the uniqueness of the solution. If p\ + 2p% • • • + rp r < 1, then 
it is easy to see that G' r (t) < 1 for t £ [0,1). Since trivially G r (0) = po > 0 and G r (l) = 
Po + " • + Pr < 1, there is indeed a unique solution of the equation G r (t) = t for t £ [0,1]. 

If pi + 2p 2 • • • + rp r > 1, then clearly p is supercritical and po +Pi < 1- Let q be the smallest 
solution of G(t) = t for t £ [0,1] (see e.g., Section 1.3 in jlj). Trivially G r (t) < G(t ) and 
G' r (t) < G'{t). So G' r {t) < G'(t) < G'{q) < 1 for t < q (again, see Section 1.3 in [T]). Since 
trivially G r (0) = po > 0 and G r {q) < G(q) = q, we see that there is a unique solution of the 
equation G r (t) = t for t £ [0, q]. Since G r (t) < G(t), so if G r (t) = t has a solution for t £ (q, 1], 
then G r ( 1) = 1 = Po + • • • + p r - We are done. □ 

Of course, If p is supercritical with po + • • • +p r = b then trivially P [M < r] = 1. In this case, 
we may just define q[o, r ] = 1- Or we may just say that gro^i is the largest solution of the equation 
G r (t) = t for t € [0,1]. Let q r = gj 0 ,r] — 9[o,r-i] f° r r — 1) an d Qo = 9[o,o]- Then P [M = r\ = q r . 

From now on we study the tail of M. We first treat the subcritical case. Note that this 
result was first proved in Theorem 3.3 of [3], by the barehands method. Here we prove it by the 
calculus method, which we feel is more revealing. Recall that H{r) is the distribution function 
of M and H(r ) the tail function, so H(r ) = m 0ir i. Also recall the simple fact that if po > 0, then 
q r > 0 if and only if p r > 0, see Lemma 3.1 in |3j. 


Proposition 2.6. Assume that p is subcritical and unbounded, then 


lim 

r—>oo 


Hr) 

H(r) 


V P r 1 
lim — = l — /i, 

r—)• oo q r 


where the last limit is understood along the infinite subsequence {r;p r > 0}. 
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Proof. The first limit is implied by the second one. For the second limit, by Lemma [2.5 1 we have 

9[0,r] = G r (?[ 0 , r] ) = G r - l(9[0,r]) +PrQ[( Q >r p 9[0,r-l] = GV-l(<7[0,7— 1])- 

By considering the difference of the these two identities and mean value theorem, we get 


( 2 ) 

where a r is a certain point in 
P [M = 


_ dG r - 1 (y) l , _ 

Qr — Qr Q \y=a r ~r PrQ[ o, r ]> 
the open interval (9[o,r-i]> 9[o,r])- 
r] < P [Mq = r] + P [Mi = r] + • 


Similar to (JT|) , we have 

.. < Pr 
~ 1 -P 


So M has finite expectation, and consequently lim,.^,*, g|[ ) r j = 1. We are done since in Theorem 
12.31 we have proved that 


(3) 


lim 

r—>oo 


dGr-^y), 

8y ly=ar “ M 


□ 


The supercritical case is essentially trivial, since it can be reduced to the subcritical case. Let 
q be the probability of extinction, that is, the smallest solution of G(t) = t for t € [0,1] (see e.g., 
Section 1.3 in jTj). By a well-known result on supercritical GW trees (see e.g., Theorem 1.12.3 
on page 52 of C3)> for any supercritical and unbounded offspring distribution p with po > 0, we 
have 0 < q < 1, 

V — 1 

lim H(r) = 1 — q, and lim — - - = 1 — V iptq * _1 , 

r— >-oo r— too (j r . J 


where the last limit is understood along the infinite subsequence {r;p r > 0}. If po = 0, then 
trivially qoo = 1. 

Finally we treat the critical case. Theorem 12.31 implies that in the critical case, H{r) is 
asymptotically larger than F(r). To prove this, just notice that for any n, 


lim inf 

r—too 


H(r ) 

F(r) 


= lim inf 

r—»• oo 


P [M > r] 
¥{r) 


> 


lim 

r—> oo 


P [ M [ 0 ,n] > r] 
F(r) 


n + 1. 


Apart from this simple observation, generally speaking the critical case is much more subtle 
than the subcritical or supercritical case, as shown by Bertoin in [HE]. However we can still use 
our calculus method to derive several partial results. We first present a simple lemma on q\r t00 y 


Lemma 2.7. Assume that p is (sub)critical, then q \ riOC ) < 1/r for any positive integer r. 


Proof. This follows from a simple inequality on width of GW trees. Use X n to denote the total 
number of nodes of the tree r at generation n and define the width W(r) of the tree r by 
W(t) = sup n>0 X n . Now by optional sampling for supermartingales, we get P [W > r] < 1/r 
for any positive integer r, which in turn implies the desired inequality on f/[ ri00 ). □ 

Note that a part of the following theorem was first appeared in Lemma 1 of [3]. 


Theorem 2.8. Assume that p is critical and unbounded, then for q r we have 

v 3V ^ Pr , v Qr ^ 2 

lim — = oo, q r < — -— —ana lim sup— < 

r^oo oo Pr & 
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where o 2 is the variance of p (could be infinite) and the limits are understood along the infinite 
subsequence {r;p r > 0}. For H(r) similarly we have 

F(r) ^ 2/ 


H(r) 

lim TPT = °°’ H(r)< , 

r^oo F(r) Li>r( l ft) 


, v H 2 (r) 2 

and hm sup - . . < 


r —>oo F(v') (J 2 


Proof. For the first statement, recall ([2]) and (JHJ). By Lemma [27T1 we see that lirriinf r ^oo qL , > 
1 /e. We are done with the first statement since 1 — p = 0 for a critical offspring distribution p. 

For the second and third statements, apply Taylor’s theorem to the difference of H(r — 1) = 
G r -i(H(r — 1)) and H(r) = G r (H(r)) = G r -i(H(r)) +p r H r (r) to get 


Qr — Qr~ 


dGr-xiy). 


dy ly - l ~ 2 Qr 8y 2 1y 

where a r is a certain point in the open interval ( H(r — 1), H(r)). So 

Pr Pr 


1 2 d 2 G r -i(y) 


=a r +p r H r (r), 


q r < 


1 _ 9G r (y) I 
1 dy \y =1 - 


Ei>r(*Pi) 


and 


q 2 


lim sup — < lim sup 

V 71 . d Z Cr r — l[V) I 

r—>oo Pr r—>-oo --— I 


<7* 


~~5p IJ/—°r 

For the last two statements, again apply Taylor’s theorem to the difference of H(r) = 
G r (H(r)) and 1 = G r (l) + F(r) to get 

1 H 2 (F d2Gr{y ^ 

2 


ij, , t- T , N dG r {y) 

H(r) = H{r) iy; 


=1 _-lH 2 (r)^^\ y=ar +F(r), 


dy lw=i “ 2“ v ' dy 2 ' y 
where a r is a certain point in the open interval ( H(r ), 1). So 

F(r) F(r 


H(r) < 


1 dG r (y) | 

1 d y \y=i- 


E i> r (*Pi) 


and 


, tf 2 (r) , 2 

hm sup -, , < hm sup , . 
r^oo F(r) ~ r—>oo 


gp lj/=Or 


a- 


In particular, if the critical offspring distribution p has tail F(r) 
Theorem 12.81 shows that H(r) = o(r _1 ) as r —> oo. 




-2 


as r 


□ 

oo, then 
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